AITopics | better safe

Collaborating Authors

better safe

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Neural Information Processing SystemsDec-24-2025, 10:13:14 GMT

Delusive attacks aim to substantially deteriorate the test accuracy of the learning model by slightly perturbing the features of correctly labeled training examples. By formalizing this malicious attack as finding the worst-case training data within a specific $\infty$-Wasserstein ball, we show that minimizing adversarial risk on the perturbed data is equivalent to optimizing an upper bound of natural risk on the original data. This implies that adversarial training can serve as a principled defense against delusive attacks. Thus, the test accuracy decreased by delusive attacks can be largely recovered by adversarial training. To further understand the internal mechanism of the defense, we disclose that adversarial training can resist the delusive perturbations by preventing the learner from overly relying on non-robust features in a natural setting. Finally, we complement our theoretical findings with a set of experiments on popular benchmark datasets, which show that the defense withstands six different practical attacks. Both theoretical and empirical results vote for adversarial training when confronted with delusive adversaries.

adversarial training, better safe, preventing delusive adversary, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.60)

Add feedback

Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Neural Information Processing SystemsJan-13-2025, 14:12:49 GMT

Delusive attacks aim to substantially deteriorate the test accuracy of the learning model by slightly perturbing the features of correctly labeled training examples. By formalizing this malicious attack as finding the worst-case training data within a specific \infty -Wasserstein ball, we show that minimizing adversarial risk on the perturbed data is equivalent to optimizing an upper bound of natural risk on the original data. This implies that adversarial training can serve as a principled defense against delusive attacks. Thus, the test accuracy decreased by delusive attacks can be largely recovered by adversarial training. To further understand the internal mechanism of the defense, we disclose that adversarial training can resist the delusive perturbations by preventing the learner from overly relying on non-robust features in a natural setting.

adversarial training, better safe, preventing delusive adversary, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.64)

Add feedback

Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks

Yang, Wenhan, Gao, Jingdong, Mirzasoleiman, Baharan

arXiv.org Artificial IntelligenceOct-5-2023

Contrastive Language-Image Pre-training (CLIP) on large image-caption datasets has achieved remarkable success in zero-shot classification and enabled transferability to new domains. However, CLIP is extremely more vulnerable to targeted data poisoning and backdoor attacks, compared to supervised learning. Perhaps surprisingly, poisoning 0.0001% of CLIP pre-training data is enough to make targeted data poisoning attacks successful. This is four orders of magnitude smaller than what is required to poison supervised models. Despite this vulnerability, existing methods are very limited in defending CLIP models during pre-training. In this work, we propose a strong defense, SAFECLIP, to safely pre-train CLIP against targeted data poisoning and backdoor attacks. SAFECLIP warms up the model by applying unimodal contrastive learning (CL) on image and text modalities separately. Then, it carefully divides the data into safe and risky subsets. SAFECLIP trains on the risky data by applying unimodal CL to image and text modalities separately, and trains on the safe data using the CLIP loss. By gradually increasing the size of the safe subset during the training, SAFECLIP effectively breaks targeted data poisoning and backdoor attacks without harming the CLIP performance. Our extensive experiments show that SAFECLIP decrease the attack success rate of targeted data poisoning attacks from 93.75% to 0% and that of the backdoor attacks from 100% to 0%, without harming the CLIP performance on various datasets.

data poisoning and backdoor attack, pre-training clip, targeted data poisoning, (1 more...)

arXiv.org Artificial Intelligence

2310.05862

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Opinion: When It Comes to AI, Better Safe Than Quick

#artificialintelligenceAug-15-2016, 12:45:28 GMT

Some people got a good laugh when a chatbot was given its own Twitter account – and was then transformed into a Holocaust-denying racist in 24 hours and swiftly taken offline. The Microsoft chatbot – called Tay – is a piece of software which can communicate with others with no human involvement. This bot was equipped with artificial intelligence but was easily manipulated by Twitter users. Teaching a robot works just like drilling an innocent, unknowing child, as artificial intelligence learns from the sum of its experiences, much like human beings. The more often a subject is talked about, opinions expressed, and certain wordings used, the more likely the software is to consider it normal and use it.

better safe, chatbot, natural language, (3 more...)

#artificialintelligence

Country: Europe > Germany (0.12)

Technology:

Information Technology > Communications > Social Media (0.96)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.59)

Add feedback

Better safe than sorry: Risky function exploitation through safe optimization

Schulz, Eric, Huys, Quentin J. M., Bach, Dominik R., Speekenbrink, Maarten, Krause, Andreas

arXiv.org Machine LearningMay-14-2016

Exploration-exploitation of functions, that is learning and optimizing a mapping between inputs and expected outputs, is ubiquitous to many real world situations. These situations sometimes require us to avoid certain outcomes at all cost, for example because they are poisonous, harmful, or otherwise dangerous. We test participants' behavior in scenarios in which they have to find the optimum of a function while at the same time avoid outputs below a certain threshold. In two experiments, we find that Safe-Optimization, a Gaussian Process-based exploration-exploitation algorithm, describes participants' behavior well and that participants seem to care firstly whether a point is safe and then try to pick the optimal point from all such safe points. This means that their trade-off between exploration and exploitation can be seen as an intelligent, approximate, and homeostasis-driven strategy.

artificial intelligence, participant, upstream oil & gas, (17 more...)

arXiv.org Machine Learning

1602.01052

Genre: Research Report (1.00)

Industry: Energy > Oil & Gas > Upstream (0.75)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback